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Abstract — The capacity region of the finite-field multi-way 
relay channel (FFMWRC) with independent private messages 
was established by Ong, Johnson, and Kellett (IT-2011). In 
this paper, we extend the capacity results to include pairwise 
common messages (each known to two nodes) in addition to 
private messages (known to only one node). We first show that 
the functional-decode-forward coding scheme that achieves the 
capacity region of the FFMWRC with independent messages is 
strictly suboptimal when there are common messages. We then 
construct a new coding scheme that achieves the capacity region 
of the FFMWRC with pairwise common messages. 

I. Introduction 

The multi-way relay channel models data exchange among 
L wireless users through a relay (e.g., base station or satellite). 
The two-way relay channel (i.e., L = 2) has been intensively 
studied [1], [2|, [3|. A canonical generalization is the multi-way 
relay channel (MWRC), L > 2, where each user sends its data 
to all other users. 

The MWRC is commonly studied in the context of indepen- 
dent messages 0, 0, 0, Q, 0. Recently, we investigated 
the MWRC with arbittarily correlated messages where the 
uplinks (i.e., the channels from the users to the relay) are 
orthogonal (i.e., non-interfering) [9] . The coding scheme 
proposed therein requires non-interfering uplinks. In this paper, 
we study the MWRC with common messages, i.e., some 
messages are known to two users. While common messages 
is a special form of correlated messages, the uplink here is 
interfering (the users' inputs to the channel interfere with each 
other), modeled by the finite-field additive channel. 

We derive the capacity region of the finite-field MWRC 
(FFMWRC) with common messages. To this end, we con- 
struct an optimal coding scheme based on functional decode- 
forward 13 where the relay decodes functions of the users' 
messages on the uplink, and forwards the functions back to the 
users on the downlink (i.e., the channel from the relay to the 
users). The main difficulty is to determine an optimal function^] 
that the relay should decode, taking into account that some 
messages are known to two users. 

A. Channel Model 

Fig. [1] shows the FFMWRC. Denote the users by 1, 2, ... , 
L, and the relay by 0. Further denote the channel input from 
node i by Xi, and the channel output received by node i by 

Having the relay decode all the messages is strictly suboptimal in the 
FFMWRC (5). 




Fig. 1. The finite-field multi-way relay channel, where the solid lines represent 
the uplink and the dashed lines represent the downlink 

Yi. The FFMWRC is defined as follows: 

L 

Uplink: Yq = X 1 © X 1 © • • • © X L © N ± X t © N , 

(1) 

Downlink: p Yl ,Y 2 ,...,Y L \x (2/1,2/2, ••• ,Vl\x ), (2) 

where the uplink variables Yo, Nq, and Xi for all i G 
{1, 2, . . . , L} are elements of a finite field of order F, © is 
the addition in the field, and Nq is arbitrarily-distributed noise. 
For the FFMWRC considered in this paper, we not restrict any 
channel model for the downlink. This general model includes 
the finite-field downlink channel as a special case. 

The finite-field model simplifies the additive white Gaussian 
noise (AWGN) channel — commonly used for the wireless 
environment — in the following ways: (i) there is no input 
(power) constraint for the finite-field model; and (ii) we know an 
optimal linear coding scheme for the multiple-access finite-field 
channel]^] It nonetheless retains two important characteristics of 
the AWGN channel: (i) the channel is corrupted with additive 
noise; and (ii) the channel inputs interfere linearly with each 
other, i.e., the transmitted signals are added up at the receiver^] 
Coding schemes designed for the FFMWRC 0, have been 
applied to — and shown to be optimal in — the symmetrical 
AWGN MWRC ifTTl . 

B. Common Messages and Rates 

We use the channel n times for the users to exchange 
pairwise common messages in addition to private messages. 

2 Though the lattice codes are optimal for the point-to-point AWGN channel, 
they may not be optimal for the multiple-access AWGN channel where the 
receiver is to decode a linear combination of the transmitters' messages 1 10|. 

3 These are two important characteristics of multiple-access channels in 
general: (i) the channel is noisy, and (ii) the channel inputs interfere with each 
other. 



The terms "private" and "common" here are defined with 
respect to the senders, i.e., private messages are known to 
only one sender,and common messages are messages known 
to two senders 4 Formally, there are L + (2) = L( - L 2 +1 ' > 
independent messages denoted by \Wx '■ I = {i} or I = 
for all distinct i,j G {1, 2, ... , L}|, where each 
Wx G {1,2,..., 2 nRx }. Each user i knows its private message 
Wi and common messages : j G {1, 2, . . . , L} \ 

where each Wij is the message user i shares with user j. 
By definition, Wij = Wjj. We define a rate tuple as R = 

(Rl,R 2 , • • • , Rh,R\a, R\3, ■ • • j Rl,L,R2,3, Ri,ii ■ ■ ■ , R2,L, 

. . . , Rl-i l)> which is a non-negative real vector of length 

£(£+1) 
2 

We use square brackets to indicate the variable associated 
with a channel use. The noise term No[t] is independent 
and identically distributed (iid) for each channel use t E 
{1, 2, . . . , n}. A length-n block code consists of the following: 

1) Encoding functions at each user i E {1, . . . ,L}\ 

Xi[t] = f it (Wi, (Wi,j)ie{i,2...,i}\i, (^H)re{i,2,...,t-i} 
for all t E {1,2,..., n}. 

2) Encoding functions at the relay: 
^o[*] = /ot((loM) Te{1>2 ,... >t _i } ), 
for all t E {1,2,..., n}. 

3) A decoding function at each user i E {1,2,..., L}: 
{(Wj(i))j e {i,2,...,L}\{i}, {W jk (i))j,ke{i,2,...,L}\{i}) 

= 9i ((^M)re{l,2,.. .,«})■ 

Here, Wxti) is the estimate of Wx by user i. A decoding 
error occurs if some user wrongly decodes some messages, i.e., 
WxU) 7^ Wx for some I and some i. This means each user 
needs to decode all messages. Assuming that each message is 
uniformly distributed, we denote the probability of decoding 
error by P e . We say that the rate tuple R is achievable if the 
following is true: for any e > 0, there exists a block code such 
that P e < e. The capacity region is the closure of all achievable 
rate tuples. 

C. Previous Work with Only Private Messages 



For the FFMWRC with only private messages (i.e., R 







for all we have the following capacity result, which is 

a straightforward extension of the FFMWRC with finite-field 
downlinks [5] (refer to Appendix |A| for proof): 

Lemma 1: Consider the FFMWRC with only private mes- 
sages, where each user i transmits an independent private 
message Wi. The rate Rprivate-oniy = (Ri, R2, ■ ■ ■ , Rl) is 
achievable if there exists some p(xo) such that the following 
is true for all i 6 {1, 2, ... , L}: 

Rj < min{log 2 F - H(N ), I(X a ;Y t )}. (3) 

r : / : :<: 

Conversely, if R pr i V ate-oniy is achievable, then there exists some 
p(xo) such that ([3]) is satisfied all i E {1, 2, . . . , L} with a 
non-strict inequality (i.e., < instead of <). 

4 The reader is not to be confused with common messages denned with 
respect to the receiver, which appear commonly in the context of broadcast 
channels (TJ, [B] p. 563]. 



In this paper, we refer to the coding scheme that achieves 
the capacity region in Lemma [TJ as functional-decode-forward 
for private messages (FDF-P). In brief, we let two users 
transmit at a time and have the relay decode a function (in 
this case, the finite-field addition) of the messages. We cycle 
through different user pairs that transmit in the following order: 
(1,2),(2,3),...,(L-1,L). 

D. The FDF-P Scheme is Suboptimal for Common Messages 

Although we can use the FDF-P scheme when there are 
common messages, the scheme is not always optimal. Consider 
an FFMWRC with three users, where 

1) X U X 2 ,X 3 ,Y ,N E {0,1,2,3}, i.e., F = 4, 

2) Pr{N = 0} = Pr{iV = 1} = 1/2, 
Pr{N = 2} = Pr{iV = 3} = 0, 

3) x ,Yi,r 2 ,r 3 e {0,1}, 

4) Y x = y 2 = Y 3 = X Q . 

To transmit private and common messages using FDF-P, we 
first split each common message Wij into two parts, say W[ 
. and WL and let user i transmit W' and user j transmits W\ as 
if they were private messages. More specifically, we split the 
message W1.2 into independent sub-messages, i.e., W[ of rate 
R[ and of rate R' 2 , where R[ + R 2 = R\. 2 . Similarly, we 
split the common messages (i) W\£ into W[' and W 3 with 
rates R'{ and R' 3 ' respectively, and (ii) W^,3 into W 2 and W 3 
with rates R' 2 ' and R' 3 respectively. Doing this, each user i will 
need to transmit (Wi, W[, W") to the other two users at the 
rate n = R, t + i?- + Rf-, where R[, R'! > 0. 

Suppose that the following rates are achievable by FDF-P: 
Ri = R 2 = R 3 = 0.4 - 8, R h2 = 0.2 - 6, R 1>3 = i? 2 , 3 = 
0.15 — <5 for some small 6 > 0. From Lemma [T] we have 

n +r 2 = R 1 +R 2 + R 1<2 + R'l + R 2 ' 

< log 2 F - H(Nq) = 1, (4) 



R'l + Rn 



(Rl,3 - R3) + (-^2,3 - R' 3 ) 

<1-Rx+R 2 + Ri, 2 = 35, 
R 3 + R 3 > Ri,s + ^ 2 ,3 - 3(5 = 0.3 



5(5. 



(5) 
(6) 



So, r x +r 3 = R 1 + R 3 + R' 3 + R'^ + R 1 + R{ > 1.1-7(5+7?; + 
R'{ > 1.1 - 7(5. Choosing S = 0.01, we have r x + r 3 > 1.093. 
But from Lemma [T] we must have r\ +r 3 < 1 (contradiction). 
Hence, these rates are not achievable using the FDF-P, but they 
are achievable using the new scheme proposed in this paper. 

The shortcoming of using FDF-P to send common messages 
is that the scheme ignores the fact that another user (besides the 
sender) knows the sub-messages, for example, user 2 knows 
W[. We will propose a nested FDF scheme that rectifies this 
problem. 

We have previously shown that the complete-decode-forward 
coding scheme, where the relay decodes all the messages, is 
strictly suboptimal for the FFMWRC with private messages 0. 
Since the FFMWRC with common messages includes that 
with private messages as a special case, this coding scheme is 
also strictly suboptimal. So, in this paper, we propose a new 
FDF scheme, where the relay decodes some functions of the 



messages. The challenge is to design optimal functions that 
the relay decodes. 

E. Main Results 

Each user a needs to decode all {Wx : a £ 1}. To simplify 
notation, we define the sum rate associated with node a as 



Applying Lemma [2] to the FFMWRC with S = {1,2,...,}\ 
{a} and S c — {0, a} for some a E {1,2,..., L}, we have 



Ra < I(X{l,2,...,L}\{a,}'>Yo,Ya\Xo,X a ) 



(10a) 



R. 



£ A 



E 

ie{l,2,...,L}\{a] 



E 

{t,j'}C{l,2,... 



R 



1,3- 



(7) 



L}\{a} 



= H(Y \X , X a ) + H(Y a \X , X a ,Y ) 

- H{Y \X {0At ..., L} ) - H(Y a \X {0Ai ... M ,Y ) (10b) 
< H(Y ) + H(Y a \X {0jh ... jL} ,Y ) 

- H(N ) - H(Y a \X {0tly „ tL} ,Y ) (10c) 
= H(Y ) - H(N ) (lOd) 
<log 2 F-#(iVo), (lOe) 



We will propose a new nested FDF coding scheme (see , mn. ■ a ■ a tv v \ v v 

. , F F & v where dlOcb is derived as (Yn, I/i 2 ta I — X n — Yi 



Sec. IIIi that achieves the capacity region of the FFWMRC 



with common messages, given in the following theorem: 

Theorem 1: Consider an FFMWRC with common messages. 
The rate tuple R is achievable if there exists some p(x ) such 
that the following is true for all a g {1, 2, . . . , L}: 



R* < min { log 2 F - H(N ), I(X ; Y a )}. 



(8) 



Conversely, if R is achievable, then there exists some p(xo) 
such that ([8} holds with a non-strict inequality for all a. 

Major differences between the new scheme, i.e., the nested 
FDF, and the existing FDF-P are as follows: 

1) In FDF-R the function that the relay decodes is pre- 
defined; in the nested FDF scheme, a function is chosen 
and then modified using an algorithm that we propose. 
As a result, the functions vary with the rates of the 
sub-messages. So, we might need different functions to 
achieve different rate tuples in the capacity region. 

2) In FDF-R the users transmit their messages, and the relay 
decodes functions of the messages; in the nested FDF 
scheme, the users transmit functions of their messages, 
and the relay decodes functions of the user's transmission 
(which can be functions of functions of the messages). 

3) FDF-P it not always optimal when there are common 
messages; the nested FDF is. 

II. Converse 

The capacity outer bound, i.e., the converse in Theo- 
rem [T] follows directly from the following cut- set bound fPJl 
Thm. 15.10.1]: 

Lemma 2: Consider a multiterminal network with nodes 
{1,2,..., K} where each node i sends an independent mes- 
sage W( % ~^i> at the rate R^~^ 3 ^ to each other node j, for 
j € {1,2,..., if} \ {i}. If the rates {R^^} are achiev- 
able, then there exists some joint probability distribution 
p(x\,X2, ■ ■ ■ , xk) such that 



V" < I(X S ;Y S *\X S *) 



(9) 



for all S C {1, 2, . . . , K} where X s = 



{1,2,...,£} 

forms a Markov chain, conditioning cannot increase entropy, 
and the channel noise Nq is independent of the channel inputs 

X{0,1,...,L}- 

Similarly, setting S = {0, 1, 2, . . . , } \ {a} and S c = {a}, 
we have 



Ra < 



,L}\{a}'-,Y a \X a 



= H(Y a \X a ) - H(Y a \X {0ily „ >L} ) 

<H(Y a )-H(Y a \X ) 

= I(X ;Y a ), 



(11a) 
(lib) 
(11c) 
(Hd) 



where ( |llc[ ) is derived because (Yq, X^ 2 l}) — — 
^{1,2, ...,£} forms a Markov chain, and as conditioning cannot 
increase entropy. 

If the rate tuples R is achievable, then there exists some 
p(xq,Xx, . . . , xl) such that ( |10e[ ) and ( lld i are true for all 
a G {1,2, . . . ,L}. Note that the RHS of (jTTd) depends only on 
the marginal pmf p(xq). This proves the converse in Theorem[T] 

III. ACHIEVABILITY 

In this section, we will prove the achievability of Theorem [T] 
To simplify notation, we assume (without loss of generality) 
that 

Rf > Ra (12) 

for all a G {2,3,..., L}. This means, user 1 needs to decode 
at the highest total rates. For any rate tuple R, we can always 
re-index the nodes to get ( fT"2| ). With this simplification, we 
need to show that if 



Rf <\og 2 F-H(N ), 
and if there exists some p(xo) such that 

R% <I(X ;Y a ), for all a € {1,2,. 



(13) 



,£}, (14) 



then the rate tuple R is achievable. Note that ( fT3] > is a constraint 
involving the uplink variables, and ( fT4] ) the downlink variables. 

We can think of the first constraint as the relay needing to 
decode at rate Rf. In general, the sum rate of all messages (i.e., 
2~2x Rz) i s sa "i ct ly higher than Rf. This is why the complete- 
decode-forward coding scheme is suboptimal 0, as it requires 
that the relay to decode all the messages, and this imposes a 
constraint J2x Rx < log 2 F — H(No), which is stricter than 
( fT3| . Our aim is to design codes such that the relay needs to 
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Uplink Message Transmission 
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row L 








* 








* 








* 




* 



decode only functions of the messages, i.e., at a lower rate of 

*m 

A. Using Linear Codes on the Uplink 

On the uplink, we will use linear block codes of the form 

x = (uQ<S)®q, (15) 

where the codeword x, the message u, and the dither q are 
row vectors, and G is the generator matrix. All elements are 
from the finite field of order F. In addition, each element 
in G and in q is independently and uniformly chosen over 
the finite field. Here, denotes matrix multiplication, and © 
symbol-wise addition in the finite field. 

We now state a result of random linear block codes on 
finite-field channels [5 Thm. 3]: 

Lemma 3: Consider the uplink Q. Each user encodes its 
message TJ i (a length-fc finite-field vector) to Xi (a length-n 
vector) using the linear code of the form \\5) . The users use 
different dithers but a common generator matrix. The receiver 
can decode ©^Li V \ from its n received channel outputs Yq 
with an arbitrarily small error probability if n is sufficiently 
large and if 

fclog 2 F 



<log 2 F-H(N ). 



(16) 



B. Messages to be Transmitted on the Uplink 

In order to utilize the linear block codes, we consider each 
message Wx to be a finite-field vector of length k%. We have 
used the bold-faced symbol to emphasize that the messages 
are vectors. The length of the vector is thus related to the rate 
by 2 nRl = F kl . Similar to Q, we define fej = nR% / log 2 F, 
i.e., the total number of finite-field symbols to be decoded by 
user a. 

Our aim is to construct optimal functions that the relay 
should decode on the uplink. We will use linear codes across 
multiple blocks, where in each block, the relay decodes the 
addition of the (finite-field vector) messages transmitted in 
that block. Equivalently, we will determine what messages the 
users transmit in each block. 

The transmissions are described using Table [I] In each block, 
i.e., vertically-aligned cells from rows 1 to L, the users transmit 
messages assigned to the cells. Each cell contains multiple 

5 An additional requirement is that when the relay broadcast the function 
back to the users, each user can decode the messages that it requires. 



columns; each column in the cell corresponds to a message 
symbol (columns are not drawn in the table). We refer to the 
number of columns in each cell as the block size, which is set 
to be the message length in the cell in row 1. 

We construct Table [I] as follows: In row 1, we place all the 
messages that user 1 requires, i.e., {Wx ■ 1 ^ I}. We identify 
each block by the subscript, 1, of the corresponding message 
Wx in row 1; the block size is thus k%. 

For all the L(L — l)/2 messages in row 1, user 
a G {2, 3, . . . , L} knows [L — 1) of them a priori, namely, 
{W a ,WaJ ■ J G {2,3,..., L} \ { a} I (messages with 
subscript "a"). For these messages, we put an asterisk in 
each corresponding cell (i.e., in the same block) in row a. In 
these asterisked cells, we will assign messages that (i) user 
a requires and (ii) user 1 knows. More specifically, we will 
assign j Wi, Wi tj : j £ {2, 3, ...,£} \ {a} j to these cells 
(replacing the each subscript "a" in row 1 by "1"). The idea 
is to let both users 1 and a obtain their respective required 
messages from the sum. These (L — 1) blocks in rows 1 and 
a are extracted out as follows: 



w a 






W a , a -i 


w a , a+1 




W a . L 


Wi,Wi, 2 ,.. 




a-l, Wi. a - 


f l, . . . , Wi,l 





We spread the messages jw^W^j : j G {2, 3, ...,L}\ 

{a}> across the asterisked cells in row a (because the messages 
on different rows have different lengths, and they may not align 
at the block level). We now show that the L(L— 1)/2 messages 
on row a can always fit into the corresponding [L— 1) asterisked 
cells. From ( |12) , we have 

k a + ^2 (kj + k at j)+ ^2 

je{2,3,...,i}\{o} i,je{2,3,A,-,L}\{a} 

>h+ ^2 (k j +k 1J ))+ fc ^-< 

jG{2,3,...,L}\{a} i,je{2,3,4,...,L}\{a} 

(17) 

which gives 

k a + 22 k a ,j>h+ ^ (kj + faj). 

je{2,3,...,i}\{o} je{2,3,...,i}\{o} 

(18) 

If the above equality is strict, the excess columns in the 
asterisked cell(s) will be left empty. 

We repeat this for all a G {2,3,..., L}. Doing this, in every 
block I, all rows a G I have asterisked cells. 



C. Some Properties of Table [7| 

We now prove a few properties of the aforementioned 
function construction. 

Proposition 1: User 1 knows the messages in rows 2 to L 
a priori. 

Proof: In each row a, we only assign either W\ or W\,i 
in the asterisked cells. ■ 
Proposition 2: Once user a, for some a € {2,3,..., L}, 
knows the messages in row a, it also knows the messages in 
all other asterisked cells in all other rows, i.e., rows b for all 
bE{2,3,...,L}\{a}. 

Proof: Suppose that user a has decoded the messages 
in row a, i.e., {Wx, W lti : i € {2, 3, ...,£} \ {a}}. By 
definition, it knows W\. a a priori. Since, any message in other 
rows {2, 3, . . . , L} \ {a} must be either W\ or Wi,j, user a 
knows those messages. ■ 
Proposition 3: For any a E {1,2,3, ... , L}, after decoding 
all messages in rows a and 1, user a will have decoded all the 
messages it requires. 

Proof: Recall that user a needs to decode all messages 
\W X : 2 C {l,2,...,L}\{a},\2\ < 2}. Clearly, row 

1 contains all messages that user 1 needs to decode. Now, 
we prove the proposition for each a E {2, 3, ... , L}. For 
\F\ = 1, Wi appears in row a, and {W 2 , W 3 , . . . , Wl} 
appear in row 1. For \I\ = 2, all messages {Wij : 
i,j E {2, 3, ...,£} \ {a}} = Wi appear in row 1, and 
{W ltk : k E {2,3,4,...,L} \ {a}} = W 2 in row a. So, 
Wi U W 2 = {Wij : i,j E {1,2, ... ,L} \ {a}}. ■ 

The following corollary is a straighforward consequence of 
Proposition [2] 

Corollary 1: For each user a E {2,3, . . . , L}, there are 
exactly k\ + J2ie{2 3 £}\{a} message symbols in rows 

2 to L in Table U that are unknown to user a. 

Proof: We know that the messages {W^W^; : i E 
{2, 3, . . . , L} \ {a}} are the messages in row a, and these 
messages are unknown to user a. The messages comprise ki + 
Sie{2 3 £}\{a} ^1,1 message symbols. From Proposition |2j 
we know that there are no other messages unknown to user a 
in rows 2 to L. ■ 

D. Codewords to be Transmitted on the Uplink 

With Table [I] constructed, we are ready to construct the 
uplink codewords. For each block, two users transmit — user 1 
and another user a E {2, 3, . . . , L}. 

For the first (L — 1) blocks, there is only one asterisk per 
block. For block a, for a E {2, 3, . . . , L}, user a transmits W a 
that corresponds to the cell in row 1, and simultaneously, user 
1 transmits the codeword that corresponds to the asterisked cell. 
More specifically, user a and user 1 transmits the following 
respectively: 



X a =(W a Q< 
X l = (V a Oi 



q', 



(19) 
(20) 



and G is an fc -by-n a matrix. V a is the vector defined by the 
content in the asterisked cell in block a (i.e., below cell W a ) 
in Table [I] From Proposition [T] we know that user 1 knows 
the content in the asterisked cells, and is able to transmit the 
codeword ( p0| . 

For the remaining blocks X C {(i, j) : i, j E {2, 3, . . . , L}}, 
again two users transmit (user 1 and some user a E 
{2,3, ... ,L}) simultaneously in a similar manner. Each user 
encodes a length-fcj message to a length-nx codeword as in 
(fT~9l>— (|20]> by replacing W a with W i:j , and V a with V i:j . As 
users i and j both know the message Wi.j in row 1, either of 
them transmit the codeword. User 1 transmits a function of its 
messages. In block (i, j), there are two asterisked cells — one in 
row i and one in row j. User 1 choose V i.j as follows: if the 
symbols from the two asterisked cells are the same, the symbol 
is selected, otherwise, the finite-field sum of the symbols is 
selected. 

From Lemma [3] if 

k x log 2 F 



< log 2 F - H(N ) 



(21) 



where X a , X\, q, and q' each are finite-field row vectors of 
length n a , W a and V a each are row vectors of length k a , 



and if the codelength nx is sufficiently large, then the relay 
can reliably decode the function (Wx © Vx). 

Remark 1: The nested FDF scheme is nested in the sense 
that user 1 may transmit functions of messages, and hence the 
relay decodes functions of functions of the messages. 

E. Decoding the Messages 

Define the concatenated functions that the relay decodes as 

U = (Wx Vx ■ F E {2.3 ,;\) 

(which is a finite-field vector of length kf). The relay 
broadcasts U on the downlink. The idea of the nested FDF 
scheme is that the functions U that the relay decodes and 
broadcasts must enable each user to obtain all its desired 
messages from the functions and the messages it knows a 
priori. To this end, we have selected the messages in the 
asterisked cells in Table [I] such that the following are true: 
1) Knowing all the messages in the asterisked cells a priori 
(Proposition [T}, user 1 can decode all messages in row 
1 from U, and hence obtain all its intended messages 
(Proposition [3]). 
2a) From U, each user a E {2,3, ... ,L} will first attempt 
to decode the messages in the asterisked cells in row a. 
2b) Once user a decodes the messages in the asterisked cells 
in row a, it knows the messages in all other asterisked 
cells (Proposition |2|. From U, it can then decode the 
messages in row 1. With this, user a will have decoded 
all its intended messages (Proposition [3]). 
Hence, we have the following: 

Proposition 4: Assuming that all users have decoded U, if 
each user a E {2, 3, . . . , L} can decode the messages in the 
asterisked cells in row a, then all users 1 to L can decode their 
intended messages. 

The decoding procedure now hinges on Step 2a. Let 
us focus on one user a. User a attempts to decode the 
messages in the asterisked cells in row a, denoted by 



W A A {Wx,W 14 : i £ {2,3,..., L}\ {a}}. It does so 
from the relevant parts of U, namely, {Wx ® Vx ■ 1- £ 8}, 
where 6 = {a, (a, i) : i £ {2, 3, . . . , L} \ {a}} are all the 
blocks with asterisked cells in row a. Recall that each symbol 
in Vq = {Vx ■ 2" £ 0} is the sum of distinct symbols in a 
column from rows 2 to L. The blocks in are shown in the 
table below: 



row 1 

row 2 



row L 



Wa,2 



w a 



Wl,Wl 2, • ■ ■ , ffl 



(all * cells i 



■a) 



W 



As user a knows Wq, it subtracts these from Wq © 
Vq to obtain Vq. Note that W\ comprises ki + 



E 



i€{2,3,...,L}\{a} 



A' symbols. Denote the first A' 



symbols of Vq by Vq, which are functions of only W\ 
(Proposition [2]). So, Step 2a is successful if and only if Vq 
form A' linearly independent equations. We now propose an 
algorithm to rearrange the messages in the asterisked cells to 
achieve this. 

F. An Algorithm to Shuffle the Messages 

Our aim is to get A' = k\ + J2i£{2,3,...,L}\{a} fc M linearly 
independent equations in V' B {simultaneously for all users 2 
to L). Hence, we can ignore row 1; we only need to consider 
asterisked cells in Table U Each block has at most two asterisked 
cells. For each column from row 2 to row L, we denote the 
simplified column by [^] , where a and f3 are symbols from 
the asterisked cells. If there is only one asterisk cell for that 
column, we have [ Q ] . Using this notation, we now propose 
the shuffling algorithm. 



repeat 

foreach user a = 2, 3, . . . , L do 

Consider all the non-empty columns in row a; 
Rearrange each simplified column such that the 
top element takes the symbol in row a; 
while There exists two columns [«] and [2 for 
some j3 7^ a and 7 ^ a do 

I Swap a and 7 to get Qj] and [^] ; 
end 

end 

until The while condition is not satisfied for one cycle of 
the foreach loop; 



Algorithm 1: The Shuffling Algorithm 

Although swapping the elements for one user may affect the 
simplified columns for other users, in each swap, we always 
increase the number of simplified columns with two identical 
elements, i.e., ["] . As there are only a finite number of columns, 
the algorithm will always terminate after at most kf cycles of 
the foreach loop. 



We will now show that when the algorithm ends, user a g 
{2, 3, . . . , L} can decode the messages on row a, denoted by 
Wa (consisting of A' symbols). We treat [ a ] as ["] because 
only one copy of the same symbol is transmitted to the relay. 
When we swap the elements for each user in the algorithm, 
we always swap the top elements, i.e., symbols in the same 
row — the bottom elements may be from different rows. 

Consider the decoding of user a using the A' non-empty 
simplified columns. The user again rearranges each simplified 
column such that the top element takes the symbol in row a. 
Doing that, the top elements of these simplified columns are 
distinct — they corresponds to the A' symbols in W\. Now, if 
a symbol a appear at the top of a simplified column and the 
bottom of another, it has to take the form [ a ] , [f] >[„],••• > [£] , 
because all a can only appear once at the top, and the case 
[o] and [2] for any j3 ^ a and 7 7^ a has been eliminated. 
Consequently, user a can decode W\. This is true for all 
a £ {2,3, ... , L}. Combining this with Proposition [4] we have 
the following: 

Proposition 5: If a user can decode U, then it can decode 
all its intended messages. 

G. Rate Bounds for Decoding U 

Now, we derive the rates at which the users can transmit 
such that all users can reliably decode U. 

We first determine the condition for which the relay can 
reliably decode U. On the uplink, the users transmit kf aligned 
symbols in n channel uses across all blocks in Table [I] We 
allocate the number of channel uses for each block proportional 
to the message length, i.e., 



Tlx 



ki 



(22) 



for all Ie{2,3,...,L, ;'•./;,,,. (j..! /.; }• if 



Rf 



k?log 2 F = kxlog 2 F 
n nx 



<log 2 F-H(N ), (23) 



and if n is sufficiently large, then pTj ) is satisfied for all X, 
and thus the relay can reliably decode U. 

Now, suppose that the relay has successfully decoded U. 
Note that U = {U^\U^ 2 \. . . , U^) is a finite-field vector 
of length kf. Hence, there are at most F kl distinct vectors U. 
On the downlink, the relay first chooses some p(xq), generates 
F fc i sequences x , each of length n, and indexes them as 
X (u^\ u^ 2 \ . . . , u( fc i ') = Xo(u). Upon decoding U on 
the uplink, the relay transmits Xq(U) on the downlink. 

Each user a G {1, 2, . . . , L} attempts to decode U on the 
downlink with the help of its prior messages, {Wx '■ a £ I}. 
With the correct prior messages, all ambiguities in U can only 
be caused by different {Wx : a £ I}. For user a, let V a 
denotes the set of distinct u that can be formed by all possible 
{Wx :a<£T}. We have 

\V a \ < \{Wx :a£l}\ = F k * . (24) 

The set T) a contains all possible u user a may decode to. 



Knowing the messages {W x '■ a G eacn user a- € 
{1, 2, . . . , L} decodes U from its received downlink channel 
outputs Y a if it can find a unique vector u E T> a such that 

{X (u),Y a )£Ai n \X ,Y a ), (25) 

where Ae n ^(Xrj, Y a ) is the set of jointly typical sequences 
{(x , y a )} lfT3l p. 195]. Otherwise, user a declares a decoding 
error. So, user a makes an error in decoding if the event E\\JEi 
occurs, where 

• E\. the correct U E T> a does not satisfy (|25), 

• E2'. some wrong u' E T> a satisfies ( |2~5j ). 

By definition, U E T> a . From the joint asymptotic equiparti- 
tion property (JAEP), we know that ifPJl Thm. 7.6.1] 

Pr{£i} < e. (26) 

We now evaluate the probability of E%: 

PrjE'z} = Pr{some u' E V a satisfies ([25j> (27a) 

< PT i u ' satisnes <|25j} (27b) 

u>GT) a \{U} 

< (F fe » - i) 2 -»(J(*o;V.)-3e) (2?c) 

< 2 «([fcf log 2 F]/n-/(Xo;Y a )+3e)^ ^d) 

where ( |27c) follows from (f24| and the JAEP [13] Thm. 7.6.1]. 

This means by choosing a sufficiently small e and a 
sufficiently large n, if 

s= fcflofeF < 
n 

then Pr{i?iUi?2} < Pr{Ei}+Pr{E 2 } can be made arbitrarily 
small, meaning that user a can realibly decode U. 

Consequently, if ( f]"3") l and ( fT4] i are satisfied for all a e 
{1,2,..., L}, then all users can reliably decode U. It follows 
from Proposition [4] that each user can reliably decode its 
intended messages. With this, we have proven the achievability 
of Theorem [TJ 

Remark 2: In our proposed coding scheme, the relay trans- 
mits after decoding U. This means a total of 2n channel 
uses (n for the uplink, followed by n for the downlink). 
This issue can be easily fixed by repeating this scheme for 
multiple messages over multiple blocks. Using the uplink and 
the downlink simultaneously, the relay transmits U that it has 
previously decoded in the previous block, while, at the same 
time, the users transmits new messages. This is a commonly- 
used technique for relay channels (see, e.g., lfT4l ||5l). 

Appendix A 
Generalization of the Capacity Results for 
Arbitrary Downlink 



for all i E {1,2,..., L}, then the relay can reliably decode 
a function (denoted by U) of the users' messages. From 
eqns. (59),(60),(63f),(64e)], we know that if 

]T Rj < I(X ;Yi), (30) 
je{i,2,...,L}\{i} 

for all i € {1, 2, . . . , L} and for some p(xq), then each user i 
can reliably decode U, and subsequently obtain the other users' 
messages. This proves the achievability of rates satisfying ([3]). 
The converse follows from Section [II] by setting all Rij = 0. 
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We have previously established J5J Sec. VA] that if 



Rj <\og 2 F-H(N ), (29) 

j£{l,2,...,L}\{i} 



